Ensemble System of Deep Neural Networks for Single-Channel Audio Separation
نویسندگان
چکیده
Speech separation is a well-known problem, especially when there only one sound mixture available. Estimating the Ideal Binary Mask (IBM) solution to this problem. Recent research has focused on supervised classification approach. The challenge of extracting features from sources critical for method. been accomplished by using variety feature extraction models. majority them, however, are concentrated single feature. complementary nature various have not thoroughly investigated. In paper, we propose deep neural network (DNN) ensemble architecture completely explore complimentary diverse obtained raw acoustic features. We examined penultimate discriminative representations instead employing acquired output layer. learned were also fused produce new vector, which was then classified Extreme Learning Machine (ELM). addition, genetic algorithm (GA) created optimize parameters globally. results experiments showed that our proposed system considered and produced high-quality IBM under different conditions.
منابع مشابه
Single Channel Audio Source Separation using Deep Neural Network Ensembles
Deep neural networks (DNNs) are often used to tackle the single channel source separation (SCSS) problem by predicting time-frequency masks. The predicted masks are then used to separate the sources from the mixed signal. Different types of masks produce separated sources with different levels of distortion and interference. Some types of masks produce separated sources with low distortion, whi...
متن کاملCombining Mask Estimates for Single Channel Audio Source Separation Using Deep Neural Networks
Deep neural networks (DNNs) are usually used for single channel source separation to predict either soft or binary time frequency masks. The masks are used to separate the sources from the mixed signal. Binary masks produce separated sources with more distortion and less interference than soft masks. In this paper, we propose to use another DNN to combine the estimates of binary and soft masks ...
متن کاملDiscriminative Enhancement for Single Channel Audio Source Separation Using Deep Neural Networks
The sources separated by most single channel audio source separation techniques are usually distorted and each separated source contains residual signals from the other sources. To tackle this problem, we propose to enhance the separated sources to decrease the distortion and interference between the separated sources using deep neural networks (DNNs). Two different DNNs are used in this work. ...
متن کاملSingle Channel Audio Source Separation
-Blind source separation is an advanced statistical tool that has found widespread use in many signal processing applications. However, the crux topic based on one channel audio source separation has not fully developed to enable its way to laboratory implementation. The main idea approach to single channel blind source separation is based on exploiting the inherent time structure of sources kn...
متن کاملMonoaural Audio Source Separation Using Deep Convolutional Neural Networks
In this paper we introduce a low-latency monaural source separation framework using a Convolutional Neural Network (CNN). We use a CNN to estimate time-frequency soft masks which are applied for source separation. We evaluate the performance of the neural network on a database comprising of musical mixtures of three instruments: voice, drums, bass as well as other instruments which vary from so...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Information
سال: 2023
ISSN: ['2078-2489']
DOI: https://doi.org/10.3390/info14070352